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(54) Picture compression process 

(57) The invention relates to a picture compression 
process in which each picture macrobiock is subjected 
to the coding chosen, for each macrobiock, from among 
several coding types. 

To select the coding type to be applied to the mac- 
robiock, the cost (Cj) or a quality factor (MB.Quality) is 



determined, by trial codings, for each coding type i. The 
coding selection is made as a function of the values of 
the cost or of the quality factor or of the pairs of values 
of these parameters. 

Application to MPEG coding, especially MPEG2 
coding, so as to deliver pictures of constant quality and/ 
or good quality/cost ratio. 
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Description 

[0001] The invention concerns a picture compression process in which each picture or picture macroblocic is sut>- 
jected to a coding chosen from among several coding types. 
5 [0002] It relates more particularly to the MPEG2 compression process. Although the invention is not limited to this 
type of compression, in the description hereafter, we shall refer mainly to this type. 

[0003] Recalled below is the principle of such compression and. at this juncture, the coding types which must be 
selected for each macroblocl^ will be indicated. 

[0004] In the MPEG2 standard, it is possible to start from a picture containing in progressive mode 576 lines of 720 
10 points each, in interlaced mode, this picture is composed of two frames each of which comprises 288 lines, also of 
720 points each. 

[0005] Each picture Is split up into macroblocks, each of which is formed by a square of 16x16 luminance points. 
Each macroblock is thus formed of 4 square blocks of 8x8 luminance points. With each of these 4 luminance blocks 
there are associated (in the 4.2.0 format) two chrominance blocks each of which has 8x8 points, one of the blocks 

15 representing the colour difference or red chrominance signal Cr and the other block representing the colour difference 
or blue chrominance signal Cb. In the 4.2.2 format, with each luminance macroblock there are associated four 8x8 
chrominance blocks, 2 blocks for the blue chrominance and 2 blocks for the red chrominance. There is also a 4.4.4 
format In which each of the luminance and chrominance components each comprises 4 blocks of 8x8. 
[0006] Represented in Figure 1 are four 8x8 luminance blocks, with reference 1 0 within the set and 8x8 chrominance 

20 blocks 12 and 14 for the blue and red chrominances respectively, the set illustrating a macroblock in the 4.2.0 norm. 
[0007] Each block is coded by using a transformation denoted DCT which is a discrete cosine transformation which 
makes it possible to transform a luminance block (for example) into a block of coefficients representing spatial frequen- 
cies. As may be seen in Figure 2, a source block 16 Is transfbnnned into a block 18 of 8x8 coefficients. The upper left 
comer 20 of the block 18 corresponds to the zero spatial frequencies (mean value of the block) and. onwards of this 

25 origin 20, the horizontal frequencies increase towards the right, as represented by the anxjw 22, while the vertical 
spatial frequencies increase from top to bottom, as represented by the arrow 24. 

[0008] For each macroblock, it is necessary to choose the coding type: either "Intra" or "inter^. Intra-coding consists 
In applying the DCT transformation to a source block of the picture, while Inter-coding consists in applying the OCT 
transformation to a block representing the difference between a source block and a predicted block, or prediction block, 

30 of a preceding or following picture. 

[0009] The choice depends in part on the type of pictures to which the macroblock belongs. These pictures are of 
three types: the first type is the so-called I or Intra type, for which the coding Is intra for all the macroblocks. 
[0010] The second type is of P or prediction type; in pictures of this type, the coding of each macroblock can be 
either intra or inter. In the case of inter-coding on a picture of P type, the DCT transformation is applied to the difference 

35 between the current macroblock of this picture P and a prediction macroblock arising from the preceding I or P picture. 
[0011] The third type of picture is called B or bidirectional. Each macroblock of such a picture type Is either intra- 
coded or inter-coded. Inter-coding consists also In applying the transformation to the difference between the current 
macroblock of this B picture and a prediction macroblock. This prediction macroblock may arise either from the pre- 
ceding picture or from the following picture or from both at once (bidirectional prediction), it being possible for the so- 

40 called preceding or following prediction pictures to be of I or P type only. 

[0012] Represented in Figure 3 is a set of pictures forming a group called a GOP (Group Of Pictures) which comprises 
twelve pictures, namely an I picture followed by eleven B and P pictures according to the following succession: B. B, 
P. B, B, P, B, B. P. B. B. 

[0013] In the case of predicted pictures (that is to say those deduced from other pictures), motion estimation followed 
45 by motion compensation are applied to the macroblock to be coded. This is because, between two pictures, the mac- 
roblock may be situated at different locations by reason of the inter-picture and inter-frame motions. The effect of motion 
compensation Is to compute the prediction macroblock according to a given mode of interpolation (commonly called 
the prediction mode); this macroblock will actually serve as prediction for the source macroblock in inter-plcture mode 
for a given coding mode. Hereinafter, this prediction macroblock and, by the same token this coding mode, will be 
50 retained or rejected depending on the decisions taken within the procedure for computing the choice of the coding mode. 
[0014] Moreover, in the case of interiaced scanning, for which each picture is formed of two successive fi^mes, an 
odd frame and an even frame, it is necessary to determine whether the DCT transformation should be performed 
progressively or Individually on each frame. This is because, depending on the motion of the picture or the structure 
of this picture, the result of the coding may be different depending on whether the transformation is performed on the 
55 picture or on each frame. 

[001 5] This choice is represented by Figures 4a and 4b. Represented in Figure 4a is a macroblock 28 of an Interlaced 
picture formed of lines 30^. 3O3. ... 30^5 of an odd frame and of lines 3O2, 3O4 .... 30ie of an even frame. Figure 4a 
corresponds to a DCT transfonmation performed on the picture; each of the four blocks of the macroblock 28 Is trans- 
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formed without rearranging the lines. Thus, the coding is performed on the four blocks 26^, 282. 283, 284 forming the 
macroblock 28 and the transformation is perfomned on lines 30i to 30g for blocks 28^ and 283 and on lines 30g to 30^^ 
for blocks 283 and 284. 

[0016] On the other hand, Figure 4b represents a transformation performed separately for the odd and even frames. 
5 Blocks 32^ and 322 conrespond to the odd frame and blocks 323 and 324 to the even frame. Thus, block 32^ comprises 
lines 30^, 3O3, .... 30^5. while blocks 323 and 324 comprise lines 3O2, 3O4 30^5. 

[0017] Represented in Figure 5 is a chart in block form representing the various operations to be performed in respect 
of the picture compression or video compression. Each digitized picture Is applied to an Input of a facility 40 which 
performs the separation into 8x8 blocks and these 8x8 blocks are transmitted to a facility 42 for selecting between the 
10 intra-coding and the inter-coding. If the coding chosen is intra, the block is transmitted to the DCT transformation facility 
44. If the coding is inter, the block is subjected to a subtraction by a subtractor facility 46 which takes the differences 
between the block itself and a prediction block delivered by a time prediction facility 48. 

[0018] After the DCT transformation 44, a quantization 50 is perfonmed and the quantized coefficients thus obtained 
are coded according to a VLC coding of variable or fixed length 52. The coefficients thus coded obtained at the output 
of the coder 52 are directed to a buffer memory 54 whose output constitutes the coding output 56. To avoid saturation 
and drying up of the buffer memory 54, regulation 60 is performed which modifies the quantization 50. 
[0019] To be able to perform the time prediction, the output of the quantization facility 50 is linked to the Input of a 
facility 62 for inverse quantization Qr^ whose output is applied to the input of a facility 64 performing the inverse cosine 
transformation DCT-\ The output of the facility 64 is transmitted directly to a picture memory 66 when the coding of 
20 the block is intra, as detemnined by a facility 68. On the other hand, when the facility 68 decides that the coding is inter, 
the output of the block 64 is added, by virtue of an adder 70, to the prediction macroblock delivered by the facility 48 
and it is the output from the adder 70 which is transmitted to the picture memory 66. The memory 66 keeps decoded 
pictures. 

[0020] The inter-picture and inter-frame motions are estimated by a facility 72 which receives, on the one hand, 
25 information firom the picture memory 66 and, on the other hand, from the output from the facility 40 for constructing 
blocks. Thus, it may be seen that the time prediction 48 is performed, on the one hand, on the basis of the picture 
memory 66 and, on the other hand, of the motion estimation 72. 

[0021] The computation of the coding or binary train 52 depends, among other things, on the motion vectors arising 
from the motion estimation 72, the coded DCT coefficients, the headers of the macroblocks, and MPEG2 coding in- 
30 formation delivered by a facility 74. This infonmation relates to the MPEG2 signalling cues, namely the headers of the 
lines of macroblocks (or "slices"), the headers of the pictures of the GOPs and the headers of the sequence to be coded. 
[0022] For the pictures of I type, the coding must be chosen between frame-wise intra-coding or picture-wise intra- 
coding. This choice Is made on the basis of an analysis of the activity contained in the macroblock; it gives good results 
in general. 

35 [0023] For the pictures of P type and the pictures of B type, the number of decisions to be made is substantially larger. 
[0024] Thus, a P macroblock may be coded according to eight basic modes: 

intra; frame DCT, 

intra; picture DCT, 
40 . without motion compensation (noMC); frame DCT, 

without motion compensation (noMC); picture DCT, 

with motion compensation; prediction by earlier frames; frame DCT, 

with motion compensation; prediction by eariier frames; picture DCT, 

with motion compensation; prediction by eariier picture; frame DCT, 
45 . with motion compensation; prediction by eariier picture; picture DCT. 

[0025] For the pictures of B type, 14 basic coding modes are possible, namely two intra-coding modes, the four 
modes with motion compensation indicated with regard to the pictures of P type and. in addition, four similar coding 
modes based on the later prediction picture and 4 bidirectional modes, namely: 

50 

prediction by later frames; frame DCT, 
prediction by later frames; picture DCT, 
prediction by later picture; frame DCT, 
prediction by later picture; picture DCT, 
55 - frame-wise bidirectional prediction; frame DCT, 
frame-wise bidirectional prediction; picture DCT, 
picture-wise bidirectional prediction; frame DCT. 
picture-wise bidirectional prediction; picture DCT. 
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[0026] It has been noted that the criteria used hitherto to choose between the various modes for the P and B pictures 
gave results of variable quality. 

[0027] The invention makes It possible to deliver pictures of relatively constant quality for these P and B pictures. 
[0028] The invention also makes it possible to deliver pictures of better quality/cost ratio as compared with the pictures 

5 obtained with the known processes. 

[0029] To this end, the compression process in accordance with the invention is characterized in that, for each mac- 
roblock, a trial coding is performed according to all the possible modes or according to some of these modes, and the 
coding cost and/or a quality factor is determined for each trial coding, the coding mode used being selected as a 
function of the value of the cost of the coding and/or of the value of the quality factor. 

10 [0030] The expression "coding cost" should be understood to mean the number of bits used for each coding. 

[0031] Stated otherwise, the coding mode is not chosen on the basis of an internal analysis of the macroblocks (as 
is the case in the prior art) but by trying alt the coding possibilities (or some of them) and by retaining the mode which 
gives the best result, either because it minimizes the coding cost, or because it maximizes the quality of the picture, 
or, preferably, if it delivers the best compromise between the coding cost and the quality of the picture. 

15 [0032] The quality factor for the picture is determined by a comparison between the source macroblock and the 
decoded macroblock. The latter corresponds to the macroblock subjected to a trial coding and then decoded. 
[0033] In one embodiment, the mean quality factor is determined over a sequence of preceding pictures contained 
in a sliding time window and a coding type is selected which delivers a quality factor which is closest to this mean 
quality factor. 

20 [0034] In another embodiment, for each macroblock, the coding cost is detemiined for each coding type, the quality 
factor for the coding type ensuring minimum cost is compared with a mean value of quality factor over a sequence of 
preceding pictures contained in a sliding time window, and the coding type delivering minimum cost is selected if the 
corresponding quality factor is at least equal to the mean quality fector Increased by an increment. 
[0035] The compression process according to the invention can be used for all applications which call upon video 

25 compression of the MPEG type (MPEG1, MPEG2, MPEG4) or the like. It can be used, in particular, for the studio 
recording of records or magnetic tapes. It can also be used for real-time coding, for example in video cameras. 
[0036] Experiments performed within the context of the invention have shown that the compression according to the 
invention makes it possible, for the same quality of picture, to use around 25% fewer bits than compression carried 
out in a conventional manner. 

30 [0037] Other characteristics and advantages of the Invention will become apparent with the description of certain of 
its embodiments, this description being given with reference to the appended drawings in which: 

Figures 1 to 5, already described, represent the principle of MPEG2 compression, 

Figures 6 and 7 are charts explaining certain aspects of a compression process In accordance with the invention. 

35 

[0038] in the example of compression which will be described below, all the possible coding modes (i.e. two possible 
modes for the pictures of I type, eight basic modes for the pictures of P type and fourteen basic modes for the pictures 
of B type) are considered, for each macroblock. For each of these modes, a trial coding is performed and the following 
parameters are subsequently determined: 

40 

Cost C of coding the macroblock, that is to say the number of bits obtained at the conclusion of the trial coding. 
The quality factor MB_Qualrty of the reconstructed macroblock. How this quality factor can be determined will be 
explained below. 

The temporal mean value, denoted Seq.Quality, of the quality factor MB.Quallty over the picture or over a sequence 
45 of pictures. Knowing this parameter may enable the possible coding overcost to be limited In the situation where 

the factor MB^Quality for a coding mode under consideration appreciably exceeds the value of Seq.Quality. 
A threshold, denoted overC, which depends on the type of picture under consideration (I, P or B), representing an 
allowable overcost of coding so as to raise the quality factor MB.Quality of the macroblock. The allowable overcost 
overC for a picture of I type will be greater than the allowable overcost of coding for a picture of P or B type. 

50 

[0039] The quality factor MB.Quaiity results from a comparison between the source macroblock to be coded, and 
its reconstructed counterpart for a coding mode under consideration. In the case of a criterion of PSNR type (PSNR 
here signifying "Picture Signal to Noise Ratio"), the squares of the pointwise differences between the source luminance 
pixels and the reconstructed pixels are summed (sum) and then the value of MB_Quality is calculated as follows: 

55 

MB.Quality = -10 Log^Q (sum/Lum), 
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with: 

Lum = 255*255*256 = 16646400 

[0040] In the case of the SNR (Signal to Noise Ratio) type criterion, the value of the parameter sum is calculated in 
the sanne way, while Lum corresponds to the sum of the values of 256 pixels contained In the source luminance block, 
the value of MB.Quality is likewise calculated as follows: 

MB_Quality = -10 Log^Q (sum/Lum) 



[0041] Thus, the smaller the differences, the larger the quality factor. 

[0042] In the example, the coding mode which delivers the minimum cost Cj^^fy is firstly determined and the macrob- 
15 lock quality factor MB.Quality of minimum cost, i.e. MB.QualityC^fn, is calculated. 

[0043] Subsequently, we determine whether another coding mode, which, a priori, requires a coding overcost (as 
compared with C^in). causes an acceptable increase in the quality factor MB_Quality. 

[0044] To this end. MB.QualityC^jp is compared with Seq.Quality, this being the sequence-wise mean quality factor 
over a specified number of preceding pictures, for example 20. 
20 [0045] If: 

MB.QualltyC^j^ ^ Seq.Quality + overMB^Quality, 

25 in which formula overMB^Quality represents an acceptable value of increase in the sequence-wise quality factor, 

so the coding mode which makes it possible to obtain the minimum cost C^|„ will be chosen since broadly adequate 
quality is obtained with minimum cost. 

[0046] This situation occurs in weakly textured zones of the picture (little information to be coded) and/br those for 
which the time prediction is of good quality. 
30 [0047] If: 

MB_QualityC^n ^ Seq.Quality (1), 

that is to say if the quality obtained with minimum cost is Inadequate, then the maximum allowable overcost will 
be pennitted. 

[0048] On the other hand, in other cases, that Is to say when: 



Seq.Quality ^ MB_QualityC^j„ < Seq.Quality + overMB.Quality (2), 

an overcost of less than overC will be pemnltted, this overcost being weighted by a factor S having the following 
value: 

MB^QualityC^j^- Seq.Quality 
^ " ^ overMB_Quality 

[0049] Stated otherwise, a coding overcost overC* having the following value: 

overC' = overC x S (4) 

is permitted. 

[0050] It should be noted that, in the other cases discussed above, that is to say when: 
MB^QualityCn^jn ^ Seq.Quality -i- overMB.Quality, and when: 
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MB_Qua!ityC^,„ < Seq.Quality, 

it is also possible to regard the factor S as being assigned the values 0 and 1 respectively. Stated otherwise, the 
5 factor S takes the value zero when its value resulting from fonmula (3) is negative and this factor S takes the value 1 
when its value resulting from fonmula (3) exceeds 1. 

[0051] Under these conditions, the factor S can be represented by the chart of Figure 6 which fomns an integral part 
of the present description. 

[0052] When the possible coding overcost overC or overC' has been detemriined. the best coding cost/MB_Quality 
10 pair is sought. To this end, the following is carried out: 

[0053] MB_Quality, is compared with MB.QualityC^in, MB^Quality, being \he quality factor for the trial coding mode 
under consideration of rank I. 
[0054] If: 



IS 



20 



45 



50 



MB_Qualityj = MB_QualityC^,„. 

we keep the coding which delivered the minimum cost C^i^. 
[0055] If: 



MB_Quality, < MB_QualltyC^„, 

we also adopt the coding mode delivering the minimum cost 
25 [0056] On the other hand, if: 

MB.Qualityi > MB_QualltyC^jn. 

30 we choose the coding mode by considering the following parameter 

di = Cj - [C„j„ + overC*. (MB.Quality, - MB_QualityC^,„)] (5). 

35 [0057] In this fomiula, Cj Is the cost of the current macroblock for the trial coding represented by the index i and 
overC is the allowable overcost (dependent on over_C and on S). 

[0058] dl therefore represents the distance between the cost of the current macroblock and the maximum possible 
cost. 

[0059] The coding mode delivering the smallest signed value of di will be chosen. 
^ [0060] This criterion can be represented by the chart of Figure 7 which forms an integral part of the present descrip- 
tion. In this chart, the quality factor MB.Quality has been plotted along the abscissa and the cost Cj of the current 
macroblock along the ordinate, the value C^in corresponding to the origin. The straight line 80 corresponds to the 
maximum possible cost with regard to the increase In the value of MB.Quality. Stated othenwse, no cost^B_Quality 
pair (having regard to the parameters over_C and S) which lies above the straight line 80 will be chosen. 
[0061] The process described uses the parameter Seq.Quality. The latter may reasonably be calculated and used 
only once the steady state has been established within the coder, that is to say outside of the transient periods, for 
example beyond the fiftieth or hundredth coded picture of a sequence, either when the coder is set going, or after 
detecting a "cut" (change of sequence or of scene). Accordingly, in the coding decision procedure, during the transient 
periods, the parameter S takes the value 1 and, consequently, overC' = overC; stated otherwise, any overcost Is limited 
to overC. The detecting of a cut is a piece of Information which may be delh/ered by one of the components constituting 
the assembly of the picture coder. This component may be the motion estimator. 

[0062] In the process described hitherto, all the coding modes are tried. In a simplified variant, to reduce the amount 
of calculations required, a more limited number of possibilities is tried. In one example, the choice of the parameter 
corresponding to the application of the DCT transformation to a frame or to a picture is excluded from the trial. To this 
55 end, this coding mode is chosen in the conventional manner. For example, local activity of the information contained 
in the macroblock is evaluated and, if the activity of the macroblock picture-wise Is less than the activity of the mac- 
roblock frame-wise, a picture-wise DCT transformation is chosen and, In the contrary case, a frame^se DCT trans- 
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formation is chosen. 

[0063] In this way, the catculations to be performed for the other choices are reduced, since the number of coding 
possibilities is then halved. 

[0064] In the example mentioned above, in respect of which no trial coding is performed to determine whether the 
5 DCT transformation is carried out picture-wise or frame-wise, for the pictures of I type, the process for detemnining the 
coding mode is conventional, while for the pictures of P type, the number of basic coding modes is reduced to four and 
for the pictures of B type, the number of basic codings is reduced to seven. 

[0065] The invention is not of course limited to the example described. In particular, it is not limited to the basic coding 
modes; it encompasses any other coding type such as. for example, the "dual prime" mode for the P pictures of the 
10 MPEG2 standard. 

Claims 

15 1 . Picture compression process in which each picture macroblock is subjected to a coding chosen, for each macrob- 
lock, from among several coding types, characterized in that, to select the coding type to be applied to the mac- 
roblock, the cost (C{) and/or a quality factor (MB.Quality) is determined, by trial codings, for each coding type, the 
selection being made as a function of the values of the cost and the quality factor. 

20 2. Process according to Claim 1, characterized in that the quality factor for each macroblock is determined by a 
comparison between the source macroblock and the decoded macroblock corresponding to the macroblock sub- 
jected to a trial coding. 



25 



3. Process according to Claim 1 or 2, characterized in that the coding type chosen is the one ensuring minimum cost 

4. Process according to any one of Claims 1 to 3, characterized In that a mean quality factor is determined over a 
sequence of preceding pictures (Seq.Quality) contained in a sliding time window and in that a coding type is se- 
lected which delivers a quality factor (MB.Quality) which is closest to this mean quality factor. 

30 5. Process according to any one of the preceding claims, characterized in that, for each macroblock, the coding cost 
is detennlned for each coding type, the quality factor (MR_QualityC^^ for the coding type ensuring minimum cost 
(Cj^in) Is compared with a mean value of quality factor (Seq.Quality) over a sequence of preceding pictures con- 
tained In a sliding time window, and the coding type delivering minimum cost Is selected if the corresponding quality 
factor is at least equal to the mean quality factor (Seq.Quality) Increased by an increment (overMB.Quality). 

35 

6. Process according to Claim 5, characterized in that, if the quality factor (MR_QualityC;„j^ for the coding type 
ensuring minimum cost (C^i^) is less than the mean quality factor (Seq.Quality) a coding type is chosen which 
gives rise to a coding overcost which does not exceed a specified value (overC) and which ensures an acceptable 
increase in the quality factor. 

40 

7. Process according to Claim 6, characterized in that a factor for weighting the overcost (overC) is determined by 
the following formula: 

overMB^Quatity 

this weighting factor being held at zero when its value obtained through the above formula Is negative and 
being held at 1 when the value resulting from this formula is greater than 1 . 



50 



55 



8. Process according to Claim 6 or 7, characterized in that the coding type is determined by minimizing the signed 
parameter di defined by the following formula: 

di = C, - [C„,n + overC.(MB_Quality, - MB_QualityC„,„)] 

Cj being the cost of the trialed coding, C^^i^ the minimum cost of coding from among all the trialed coding 
modes. MB^Qualityj the quality factor for the trialed coding, and MB.QualityC^nin ^® quality factor conresponding 



7 



EP 0 982 951 A1 

to the trialed coding of minimum cost. 

9. Process according to any one of the preceding claims, characterized in that the compression is of the MPEG type, 
preferably of the MPEG2 type. 

5 

1 0. Process according to Claim 9. characterized in that it is used to choose the coding type for the macrobloclts of the 
pictures of I, P and B types. 

11. Process according to Claim 9, characterized in that it is used for the coding type of the pictures of P and B type 
10 and not for the coding type of the pictures of I type. 

12. Process according to Claim 11, characterized in that, to detennine whether the cosine transformation is performed 
on a frame macroblock or on a picture macroblock, a macroblock-related parameter is determined before coding 
and the type of transfomnation is chosen as a function of this parameter. 

15 
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